NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Generative Design of Thermoset Shape Memory Polymers Driven by Chemical Group: A Conditional Variational Autoencoder Approach

https://doi.org/10.1002/pol.20240649

Das, Borun; Peters, Andrew; Li, Guoqiang; Hei, Xiali (March 2025, Journal of Polymer Science)

ABSTRACT The discovery of novel thermoset shape memory polymers (TSMPs) for additive manufacturing can be accelerated through the use of a deep‐generative algorithm, minimizing the need for laborious traditional laboratory experiments. This study is the first to introduce an innovative approach that uses a deep generative learning model, namely the conditional variational autoencoder (CVAE), to discover novel TSMPs with lower glass transition temperature () and high recovery stress values (). In this study, specific chemical groups, such as epoxy, amine, thiol, and vinyl, are integrated as constraints to generate novel TSMPs while preserving the essential reaction properties. To address the challenges posed by a small dataset, the CVAE model is used with graph‐extracted features. Unlike previous studies focused on single‐polymer systems, this research extends to two‐monomer samples, discovering 22 novel TSMPs. This approach has practical implications in additive manufacturing, biomedical devices, aerospace, and robotics for the discovery of novel samples from limited data.
more » « less
Free, publicly-accessible full text available March 15, 2026
DP-SGD-global-adapt-V2-S: Triad improvements of privacy, accuracy and fairness via step decay noise multiplier and step decay upper clipping threshold

https://doi.org/10.1016/j.elerap.2025.101476

Chilukoti, Sai Venkatesh; Hossen, Md Imran; Shan, Liqun; Tida, Vijay Srinivas; Bappy, Mahathir Mohammad; Tian, Wenmeng; Hei, Xiali (March 2025, Electronic Commerce Research and Applications)

Differentially Private Stochastic Gradient Descent (DP-SGD) has become a widely used technique for safeguarding sensitive information in deep learning applications. Unfortunately, DP-SGD’s per-sample gradient clipping and uniform noise addition during training can significantly degrade model utility and fairness. We observe that the latest DP-SGD-Global-Adapt’s average gradient norm is the same throughout the training. Even when it is integrated with the existing linear decay noise multiplier, it has little or no advantage. Moreover, we notice that its upper clipping threshold increases exponentially towards the end of training, potentially impacting the model’s convergence. Other algorithms, DP-PSAC, Auto-S, DP-SGD-Global, and DP-F, have utility and fairness that are similar to or worse than DP-SGD, as demonstrated in experiments. To overcome these problems and improve utility and fairness, we developed the DP-SGD-Global-Adapt-V2-S. It has a step-decay noise multiplier and an upper clipping threshold that is also decayed step-wise. DP-SGD-Global-Adapt-V2-S with a privacy budget of 1 improves accuracy by 0.9795%, 0.6786%, and 4.0130% in MNIST, CIFAR10, and CIFAR100, respectively. It also reduces the privacy cost gap by 89.8332% and 60.5541% in unbalanced MNIST and Thinwall datasets, respectively. Finally, we develop mathematical expressions to compute the privacy budget using truncated concentrated differential privacy (tCDP) for DP-SGD-Global-Adapt-V2-T and DP-SGD-Global-Adapt-V2-S.
more » « less
Free, publicly-accessible full text available March 1, 2026
IdentityKD: Identity-wise Cross-modal Knowledge Distillation for Person Recognition via mmWave Radar Sensors

https://doi.org/10.1145/3696409.3700254

Shan, Liqun; Zhang, Rujun; Chilukoti, Sai Venkatesh; Zhang, Xingli; Lee, Insup; Hei, Xiali (December 2024, ACM)

Recent advancements in person recognition have raised concerns about identity privacy leaks. Gait recognition through millimeter-wave radar provides a privacy-centric method. However, it is challenged by lower accuracy due to the sparse data these sensors capture. We are the first to investigate a cross-modal method, IdentityKD, to enhance gait-based person recognition with the assistance of facial data. IdentityKD involves a training process using both gait and facial data, while the inference stage is conducted exclusively with gait data. To effectively transfer facial knowledge to the gait model, we create a composite feature representation using contrastive learning. This method integrates facial and gait features into a unified embedding that captures the unique identityspecific information from both modalities. We employ two distinct contrastive learning losses. One minimizes the distance between embeddings of data pairs from the same person, enhancing intraclass compactness, while the other maximizes the distance between embeddings of data pairs from different individuals, improving inter-class separability. Additionally, we use an identity-wise distillation strategy, which tailors the training process for each individual, ensuring that the model learns to distinguish between different identities more effectively. Our experiments on a dataset of 36 subjects, each providing over 5000 face-gait pairs, demonstrate that IdentityKD improves identity recognition accuracy by 6.5% compared to baseline methods.
more » « less
Full Text Available
In-Progress: Enhancing Traffic Signal Perception for Connected and Autonomous Vehicles (CAVs) via Multi-Sensor Fusion of Camera, LiDAR, Radar, and SPaT Data

https://doi.org/10.1109/SPW67851.2025.00052

Sazzadul_Alam, A_K M; Hei, Xiali; Zhang, Yunpeng (May 2025, IEEE)

Free, publicly-accessible full text available May 15, 2026
LiveGuard: Voice Liveness Detection via Wavelet Scattering Transform and Mel Spectrogram Scaling

https://doi.org/10.1109/DSN64029.2025.00041

Shan, Liqun; Zhang, Xingli; Hossen, Md Imran; Hei, Xiali (June 2025, IEEE)

Voice-controlled interfaces are essential in modern smart devices, but they remain vulnerable to replay attacks that compromise voice authentication systems. Existing voice liveness detection methods often struggle to distinguish human speech from replayed audio. This paper introduces a novel approach, LiveGuard, utilizing wavelet scattering transform (WST) and Mel spectrogram scaling with a lightweight ResNet architecture to enhance voice liveness detection. WST captures robust hierarchical features, while Mel spectrogram scaling extracts fine-grained acoustic details, which the lightweight ResNet efficiently processes to identify live voice. Experimental results demonstrate accuracy improvements of 6% with WST and Mel spectrogram scaling, achieving a top accuracy of 97.17% on POCO dataset. Meanwhile, LiveGuard demonstrates superior performance on ASVspoof2019 and ASVspoof2021 benchmarks. It achieves the lowest equal error rate (EER) of 0.13%, and a min t-DCF of 0.00126 on ASVspoof2019, and an EER of 0.42% on ASVspoof2021, surpassing state-of-the-art methods.
more » « less
Free, publicly-accessible full text available June 23, 2026
Single image multi-scale enhancement for rock Micro-CT super-resolution using residual U-Net

https://doi.org/10.1016/j.acags.2024.100165

Shan, Liqun; Liu, Chengqian; Liu, Yanchang; Tu, Yazhou; Chilukoti, Sai Venkatesh; Hei, Xiali (June 2024, Applied Computing and Geosciences)

Micro-CT, also known as X-ray micro-computed tomography, has emerged as the primary instrument for pore-scale properties study in geological materials. Several studies have used deep learning to achieve super-resolution reconstruction in order to balance the trade-off between resolution of CT images and field of view. Nevertheless, most existing methods only work with single-scale CT scans, ignoring the possibility of using multi-scale image features for image reconstruction. In this study, we proposed a super-resolution approach via multi-scale fusion using residual U-Net for rock micro-CT image reconstruction (MS-ResUnet). The residual U-Net provides an encoder-decoder structure. In each encoder layer, several residual sequential blocks and improved residual blocks are used. The decoder is composed of convolutional ReLU residual blocks and residual chained pooling blocks. During the encoding-decoding method, information transfers between neighboring multi-resolution images are fused, resulting in richer rock characteristic information. Qualitative and quantitative comparisons of sandstone, carbonate, and coal CT images demonstrate that our proposed algorithm surpasses existing approaches. Our model accurately reconstructed the intricate details of pores in carbonate and sandstone, as well as clearly visible coal cracks.
more » « less
Full Text Available
From Virtual Touch to Tesla Command: Unlocking Unauthenticated Control Chains From Smart Glasses for Vehicle Takeover

https://doi.org/10.1109/SP54263.2024.00231

Zhang, Xingli; Tu, Yazhou Tu; Long, Yan; Shan, Liqun; Elsaadani, Mohamed A; Fu, Kevin; Lin, Zhiqiang; Hei, Xiali (May 2024, IEEE)

Full Text Available
Towards Adversarial Process Control on Inertial Sensor Systems with Physical Feedback Side Channels

https://doi.org/10.1145/3605758.3623494

Tu, Yazhou; Rampazzi, Sara; Hei, Xiali (November 2023, ACM)

Full Text Available
A reliable diabetic retinopathy grading via transfer learning and ensemble learning with quadratic weighted kappa metric

https://doi.org/10.1186/s12911-024-02446-x

Chilukoti, Sai Venkatesh; Shan, Liqun; Tida, Vijay Srinivas; Maida, Anthony S.; Hei, Xiali (February 2024, BMC Medical Informatics and Decision Making)

Abstract The most common eye infection in people with diabetes is diabetic retinopathy (DR). It might cause blurred vision or even total blindness. Therefore, it is essential to promote early detection to prevent or alleviate the impact of DR. However, due to the possibility that symptoms may not be noticeable in the early stages of DR, it is difficult for doctors to identify them. Therefore, numerous predictive models based on machine learning (ML) and deep learning (DL) have been developed to determine all stages of DR. However, existing DR classification models cannot classify every DR stage or use a computationally heavy approach. Common metrics such as accuracy, F1 score, precision, recall, and AUC-ROC score are not reliable for assessing DR grading. This is because they do not account for two key factors: the severity of the discrepancy between the assigned and predicted grades and the ordered nature of the DR grading scale. This research proposes computationally efficient ensemble methods for the classification of DR. These methods leverage pre-trained model weights, reducing training time and resource requirements. In addition, data augmentation techniques are used to address data limitations, improve features, and improve generalization. This combination offers a promising approach for accurate and robust DR grading. In particular, we take advantage of transfer learning using models trained on DR data and employ CLAHE for image enhancement and Gaussian blur for noise reduction. We propose a three-layer classifier that incorporates dropout and ReLU activation. This design aims to minimize overfitting while effectively extracting features and assigning DR grades. We prioritize the Quadratic Weighted Kappa (QWK) metric due to its sensitivity to label discrepancies, which is crucial for an accurate diagnosis of DR. This combined approach achieves state-of-the-art QWK scores (0.901, 0.967 and 0.944) in the Eyepacs, Aptos, and Messidor datasets.
more » « less
Full Text Available
A First Look at the Security of EEG-based Systems and Intelligent Algorithms under Physical Signal Injections

https://doi.org/10.1145/3591197.3591304

Hossen, Md Imran; Tu, Yazhou; Hei, Xiali (July 2023, SecTL '23: Proceedings of the 2023 Secure and Trustworthy Deep Learning Systems Workshop)

Full Text Available

« Prev Next »

Search for: All records